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Waiting algorithms for synchronization in large-scale multiprocessors 
Beng-Hong Lim, Anant Agarwal 

August 1993 ACM Transactions on Computer Systems (TOCS), volume n issue 3 

Additional Information: full citation , abstract , references , citings , index 
terms 



Full text available: ^ gpdf(2.72 MB) 



Through analysis and experiments, this paper investigates two-phase waiting algorithms to 
minimize the cost of waiting for synchronization in large-scale multiprocessors. In a two- 
phase algorithm, a thread first waits by polling a synchronization variable. If the cost of 
polling reaches a limit Lpoll and further waiting is necessary, the thread is blocked, incurring 
an additional fixed cost, B. The choice of Lpoll 

Keywords: barriers, blocking, competitive analysis, locks, producer-consumer 
synchronization, spinning, waiting time 



2 Shared memory objects: An almost non-blocking stack 
Hans-J. Boehm 

July 2004 Proceedings of the twenty-third annual ACM symposium on Principles of 
distributed computing r 

Full text available: ^ pdf(1 74.83 KB) Additional Information: full citation , abstract, references , index terms 

Non-blocking data structure implementations can be useful for performance and fault- 
tolerance reasons. And they are far easier to use correctly in a signal- or interrupt-handler 
context.We describe a weaker class of "almost non-blocking" data structures, which block 
only if more than some number N of threads attempt to simultaneously access the same 
data structure. We argue that this gives much of the benefit of fully non-blocking data 
structures, particularly for signal or interrupt hand ... 

Keywords: compare-and-swap, interrupt handler, linked list, lock-free, memory allocation, 
non-blocking, signal handler, stack 
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3 Implementation of resilient, atomic data ty pes 
William Weihl, Barbara Liskov 

April 1985 ACM Transactions n Programming Languages and Systems (TOPLAS), 

Volume 7 Issue 2 

Full text available: fi3 pdf (2.19 MB) Additional Information: full citation, abstract , references , citing s, index 

' terms , review 

A major issue in many applications is how to preserve the consistency of data in the 
presence of concurrency and hardware failures. We suggest addressing this problem by 
implementing applications in terms of abstract data types with two properties: Their objects 
are atomic (they provide serializability and recoverability for activities using them) and 
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resilient (they survive hardware failures with acceptably high probability). We define what it 
means for abstract data types to be atomic and ... 

4 S pecification and implementation of resilient, atomic data ty pes Q 
William Weihl, Barbara Liskov 

June 1983 Proceedings of the 1983 ACM SIGPLAN symp sium on Programming 
language issues in software systems 

Full text available* IS pdf (1 28 MB) Additional Information: full citation , abstract , references , citings, index 
- \M * terms 

A major issue in many applications is how to preserve the consistency of data in the 
presence of concurrency and hardware failures. We suggest addressing this problem by 
implementing applications in terms of abstract data types with two properties: Their objects 
are atomic (they provide serializability and recoverability for activities using them) and 
resilient (they survive hardware failures with acceptably high probability). We define what it 
means for abstract data types to be atomic and ... 

5 Im plementation of Argus Q 
B. Liskov, D. Curtis, P. Johnson, R. Scheifer 

November 1987 ACM SIGOPS Operating Systems Review , Proceedings of the eleventh 

ACM Symposium on Operating systems principles, volume 21 issue 5 
Full text available: 1p| pdf(1.34 MB) Additional Information: full citation , abstract , references , citings , index 
' ^ terms 

Argus is a programming language and system developed to support the construction and 
execution of distributed programs. This paper describes the implementation of Argus, with 
particular emphasis on the way we implement atomic actions, because this is where Argus 
differs most from other implemented systems. The paper also discusses the performance of 
Argus. The cost of actions is quite reasonable, indicating that action systems like Argus are 
practical. 

6 Reliable object storage to support atomic actions Q 
Brian M. Oki, Barbara H. Liskov, Robert W. Scheifler 

December 1985 ACM SIGOPS Operating Systems Review , Proceedings of the tenth ACM 

symposium on Operating systems principles, volume 19 issue 5 
Full text available: ^j| pdf(939.13 KB) Additional Information: full citation , references , citings , index terms 



7 The linked class of Modula-3 
E. Levy 

August 1988 ACM SZGPLAN Notices, volume 23 issues 

Full text available: |fj pdf(732.03 KB) Additional Information: full citation , abstract , index terms 

The desirability of providing built-in linked list support for the next generation of system 
programming languages is discussed. A Modula-2 pseudo inheritance implementation is 
given that demonstrates a proposed set of primitives. A set of rules are specified that 
enhance reliability without adding significant restrictions. Benefits to concurrency control are 
described. 

8 Guardians and actions: linguistic support for robust, distributed p ro grams 
Barbara Liskov, Robert Scheifler 

January 1982 Proceedings of the 9th ACM SIGPLAN-SIGACT symposium on Principles 
programming languages 

Full text available: ^ pdf(1.34 MB) Additional Information: full citation , abstract , references , citings 

This paper presents an overview of an integrated programming language and system 
designed to support the construction and maintenance of distributed programs: programs in 
which modules reside and execute at communicating, but geographically distinct, nodes. The 
language is intended to support a class of applications in which the manipulation and 
preservation of long-lived, on-line, distributed data is important. The language addresses 
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the writing of robust programs that survive hardware failures ... 

9 Introduction to Demos Q 
Graham Birtwistle 

December 1981 Proceedings f the 13th conference n Winter simulation - Volume 2 

Full text available* fi3pdf(1.11 MB) Additional Information: full citation , abstract , references , citings , index 

terms 

Demos [1,2] is yet another discrete event simulation language hosted in Simula. It was 
released in 1979 and is now running on IBM, DEC, UNIVAC, and CDC hardwares amongst 
others. The paper contains a short introduction to Simula's object and context features; an 
explanation of the process approach to simulation; a brief comparison of Simula and GPSS; 
and finally, the main features of Demos are presented via an example. 

10 Session: Modula-3 network objects over ANSA: hetero g eneous object-based RPC in a Q 
modern systems programming langua ge 

David Evers, Peter Robinson 

September 1992 Proceedings of the 5th workshop on ACM SIGOPS European workshop: 
Models and paradigms for distributed systems structuring 

Full text available: pdf(453.29 KB) Additional Information: full citation , abstract , references 

Modula-3 provides language-level features such as threads, objects and exceptions which 
are useful in distributed systems. The ANSA testbench provides a complete infrastructure for 
object-based distributed systems, but currently requires the use of C as the main 
programming language. We describe a successful attempt to marry the two, which provides 
a practical example of how a modern systems programming language can make the 
construction of object-based distributed systems more congenial for the ... 

11 Experience Using Multiprocessor Systems — A Status Report Q 
Anita K. Jones, Peter Schwarz 

June 1980 ACM Computing Surveys (CSUR), volume 12 issue 2 

Full text available: ^pdf(4.48 MB) Additional Information: full citation , references , citings , index terms 



12 A foundation for an efficient multi-threaded scheme system 
Suresh Jagannathan, Jim Philbin 

January 1992 ACM SIGPLAN Lisp Pointers , Proceedings of the 1992 ACM conference on 

LISP and functional programming, volume v issue 1 
Full text available: 151 pdf (1.19MB) Additional Information: full citation, abstract , references, citings, index 
' ^ ' terms 

We have built a parallel dialect of Scheme called STING that differs from its contemporaries 
in a number of important respects. STING is intended to be used as an operating system 
substrate for modern parallel programming languages. The basic concurrency management 
objects is STING are first-class lightweight threads of control and virtual processors (VPs). 
Unlike high-level concurrency structures, STING threads and VPs are not encumbered by 
complex synchronization protocols. ... 

13 Techniq ues for obtaining high performance in Java pro g rams 
Iffat H. Kazi, Howard H. Chen, Berdenia Stanley, David J. Lilja 
September 2000 ACM Computing Surveys (CSUR), volume 32 issue 3 

Full text available* fitj| pdf(816 13 KB) Additional Information: full citation , abstract , references , citings , index 

terms 

This survey describes research directions in techniques to improve the performance of 
programs written in the Java programming language. The standard technique for Java 
execution is interpretation, which provides for extensive portability of programs. A Java 
interpreter dynamically executes Java bytecodes, which comprise the instruction set of the 
Java Virtual Machine (JVM). Execution time performance of Java programs can be improved 
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Keyw rds: Java, Java virtual machine, bytecode-to-source translators, direct compilers, 
dynamic compilation, interpreters, just-in-time compilers 



14 E-commerce: SweetDeal: representing a gent contracts with exceptions using XML 
rules, ontologies, and process descriptions 
Benjamin N. Grosof, Terrence C. Poon 

May 2003 Proceedings of the twelfth international conference on World Wide Web 

Full text available: Q pdf(481.94 KB) Additional Information: full citation , abstract , references , index terms 

SweetDeal is a rule-based approach to representation of business contracts that enables 
software agents to create, evaluate, negotiate, and execute contracts with substantial 
automation and modularity. It builds upon the situated courteous logic programs knowledge 
representation in RuleML, the emerging standard for Semantic Web XML rules. Here, we 
newly extend the SweetDeal approach by also incorporating process knowledge descriptions 
whose ontologies are represented in DAML+OIL (emerging standa ... 

Keywords: DAML+OIL, OWL, RDF, XML, business process automation, declarative, 
description logic, electronic commerce, electronic contracts, intelligent software agents, 
knowledge representation, knowledge-based, logic programs, ontologies, process 
descriptions, process knowledge, rules, semantic web, semantic web services, web services 



1 5 Growing languages with metamorphic syntax macros | 
Claus Brabrand, Michael I. Schwartzbach 

January 2002 ACM SIGPLAN Notices , Proceedings of the 2002 ACM SIGPLAN workshop 
on Partial evaluation and semantics-based program manipulation, volume 

37 Issue 3 

Full text available: pelf (2 17. 81 KB) Additional Information: full citation , abstract , references , citings 

"From now on, a main goal in designing a language should be to plan for growth. " Guy 
Steele: Growing a Language, OOPSLA '98 invited talk.VJe present our experiences with a 
syntax macro language which we claim forms a general abstraction mechanism for growing 
(domain-specific) extensions of programming languages. Our syntax macro language is 
designed to guarantee type safety and terminations concept of metamorphisms allows the 
arguments of a macro to be inductively def ... 

16 EMERALDS: a small-memory real-time microkernel | 
Khawar M. Zuberi, Padmanabhan Pillai, Kang G. Shin 

December 1999 ACM SIGOPS Operating Systems Review , Proceedings of the 

seventeenth ACM symposium on Operating systems principles, volume 33 
Issue 5 

Full text available- ID Ddfd 59 MB) Additional Information: full citation , abstract , references , citings, index 
' ^ terms 

EMERALDS (Extensible Microkernel for Embedded, ReAL-time, Distributed Systems) is a 
real-time microkernel designed for small-memory embedded applications. These applications 
must run on slow (15-25MHz) processors with just 32-128 kbytes of memory, either to keep 
production costs down in mass-produced systems or to keep weight and power consumption 
low. To be feasible for such applications, the OS must not only be small in size (less than 20 
kbytes), but also have low-overhead kernel services. Un ... 

17 Network objects | 
Andrew Birrell, Greg Nelson, Susan Owicki, Edward Wobber 

December 1993 ACM SIGOPS Operating Systems Review , Proceedings of the 

fourteenth ACM symposium n Operating systems principles, volume 27 

Issue 5 

Full text available* fjEl pdf(1.35 MB ) Additional Information: full citation , abstract , references , citings , index 
. L£d M terms 
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A network object is an object whose methods can be invoked over a network. This paper 
describes the design, implementation, and early experience with a network objects system 
for Modula-3. The system is novel for its overall simplicity. The paper includes a thorough 
description of realistic marshaling algorithms for network objects. 

18 Analyzing communication latency using the Nectar communication processor 
Peter Steenkiste 

October 1992 ACM SIGCOMM Computer Communication Review , C nference 

proceedings on Communications architectures & protocols, volume 22 issue 4 
Full text available* 13 odfd 16MB) Additional Information: full citation , abstract , references , citings , index 



terms 

For multicomputer applications, the most important performance parameters of a network is 
the latency for short messages. In this paper we present an analysis of communication 
latency using measurement of the Nectar system. Nectar is a high-performance 
multicomputer built around a high-bandwidth crosspoint network. Nodes are connected to 
the Nectar network using network coprocessors that are primarily responsible the protocol 
processing, but that can also execute application code. This arch ... 



19 CPU reservations and time constraints: efficient, predictable scheduling of independent Q 
activities 

Michael B. Jones, Daniela Ro$u, Marcel-Catalin Ro§u 

October 1997 ACM SIGOPS Operating Systems Review , Proceedings of the sixteenth 

ACM symposium on Operating systems principles, volume 31 issue 5 
Full text available: ^ pdf(2.25 MB) Additional Information: full citation , references , citings , index terms 



20 Automatic performance setting for dynamic voltage scaling 
Krisztian Flautner, Steve Reinhardt, Trevor Mudge 
September 2002 Wireless Networks, volume 8 issue 5 

Full text available: ^ pdf(328.69 KB) Additional Information: full citation , abstract , references , index terms 

The emphasis on processors that are both low power and high performance has resulted in 
the incorporation of dynamic voltage scaling into processor designs. This feature allows one 
to make fine granularity tradeoffs between power use and performance, provided there is a 
mechanism in the OS to control that tradeoff. In this paper, we describe a novel software 
approach to automatically controlling dynamic voltage scaling in order to optimize energy 
use. Our mechanism is implemented in the Linux ke ... 

Keywords: dynamic voltage scaling, interactive performance, performance-setting, power 
management, response time 
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21 Firefly: a multiprocessor workstation 
Charles P. Thacker, Lawrence C. Stewart 
October 1987 Proceedings of the second international conference on Architectual 

support for programming languages and operating systems, volume is , 22 , 

21 Issue 5 , 10 , 4 

Full text available: |S|pdf(1.10 MB) Additional Information: full citation , abstract , citings , index terms 

Firefly is a shared-memory multiprocessor workstation that contains from one to seven 
MicroVAX 78032 processors, each with a floating point unit and a sixteen kilobyte cache. 
The caches are coherent, so that all processors see a consistent view of main memory. A 
system may contain from four to sixteen megabytes of storage. Input-output is done via a 
standard DEC QBus. Input-output devices are an Ethernet controller, fixed disks, and a 
monochrome 1024 x 768 display with keyboard and mouse. Option ... 
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1 Memory system analysis and optimization: Dynamic tracking of page miss ratio curve Q 
for memory mana g ement 

Pin Zhou, Vivek Pandey, Jagadeesan Sundaresan, Anand Raghuraman, Yuanyuan Zhou, 
Sanjeev Kumar 

October 2004 Proceedings of the 11th international conference on Architectural 
support for programming languages and operating systems 

Full text available: |j| pdf (281.38 KB) Additional Information: full citation , abstract , references , index terms 

Memory can be efficiently utilized if the dynamic memory demands of applications can be 
determined and analyzed at run-time. The page miss ratio curve(MRC), i.e. page miss rate 
vs. memory size curve, is a good performance-directed metric to serve this purpose. 
However, dynamically tracking MRC at run time is challenging in systems with virtual 
memory because not every memory reference passes through the operating system 
(OS).This paper proposes two methods to dynamically track MRC of application ... 



Keywords: memory management, power management, resource allocation 



2 Im plementation and performance of inte g rated a p plication-controlled file cachin g , 
prefetchin g , and disk scheduling 
Pei Cao, Edward W. Felten, Anna R. Karlin, Kai Li 

November 1996 ACM Transactions on Computer Systems (TOCS), volume 14 issue 4 

Full text available: fl pdf(609.00 KB) Additional Information: full citation , abstract, references , dfiogs, index 

terms 

As the performance gap between disks and micropocessors continues to increase, effective 
utilization of the file cache becomes increasingly immportant. Application-controlled file 
caching and prefetching can apply application-specific knowledge to improve file cache 
management. However, supporting application-controlled file caching and prefetching is 
nontrivial because caching and prefetching need to be integrated carefully, and the kernel 
needs to allocate cache blocks among processes ap ... 

Keywords: application-controlled resource management, disk scheduling, file caching, file 
prefetching 



3 Distance Associativity for High-Performance Energy-Efficient Non-Uniform Cache 
Architectures 

Zeshan Chishti, Michael D. Powell, T. N. Vijaykumar 

December 2003 Proceedings of the 36th Annual IEEE/ACM Internati nal Symposium n 
M icroa rch itect u re 
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Full text available: fi]| pdf(189.85 KB) Additional Information: full citation , abstract , index terms 
Publisher Site 

Wire delays continue to grow as the dominant component oflatency for large caches. A 
recent work proposed an adaptive^on-uniform cache architecture (NUCA) to manage large, 
on-chipcaches.By exploiting the variation in access time acrosswidely-spaced subarrays, 
NUCA allows fast access to closesubarrays while retaining slow access to far 
subarrays.Whilethe idea of NUCA is attractive, NUCA does not employ designchoices 
commonly used in large caches, such as sequential tag-dataaccess for low power.Mo ... 

Trace-driven memory simulation: a survey 
Richard A. Uhlig, Trevor N. Mudge 

June 1997 ACM Computing Surveys (CSUR), volume 29 issue 2 

Full text available* fgl pdf(636 11 KB) Additional Information: full citation , abstract , references , citings , index 
• : terms, review 

As the gap between processor and memory speeds continues to widen, methods for 
evaluating memory system designs before they are implemented in hardware are becoming 
increasingly important. One such method, trace-driven memory simulation, has been the 
subject of intense interest among researchers and has, as a result, enjoyed rapid 
development and substantial improvements during the past decade. This article surveys and 
analyzes these developments by establishing criteria for evaluating trac ... 

Keywords: TLBs, caches, memory management, memory simulation, trace-driven 
simulation 



5 A high-level abstraction of shared accesses 
Peter J. Keleher 

February 2000 ACM Transactions on Computer Systems (TOCS), Volume 18 issue i 

Full text available* fiQ pdf (183 57 KB) Add '^ orial Information: full citation , abstract, references , index terms . 
■ _ review 

We describe the design and use of the tape mechanism, a new high-level abstraction of 
accesses to shared data for software DSMs. Tapes consolidate and generalize a number of 
recent protocol optimizations, including update-based locks and recorded-replay barriers. 
Tapes are usually created by "recording" shared accesses. The resulting recordings can be 
used to anticipate future accesses by tailoring data rjiovement to application semantics. 
Tapes-based mechanisms a ... 

Keywords: DSM, programming libraries, shared memory, update protocols 



6 A framework for efficient reuse of binary code in Java 
Pramod G. Joisha, Samuel P. Midkiff, Mauricio J. Serrano, Manish Gupta 
June 2001 Proceedings of the 15th international conference on Supercomputing 

Full text available: ^ pdf(419.49 KB) Additional Information: full citation , abstract , references , index terms 

This paper presents a compilation framework that enables efficient sharing of executable 
code across distinct Java Virtual Machine (JVM) instances. High-performance JVMs rely on 
run-time compilation, since static compilation cannot handle many dynamic features of Java. 
These JVMs suffer from large memory footprints and high startup costs, which are serious 
problems for embedded devices (such as hand held personal digital assistants and cellular 
phones) and scalable servers. A recently propose ... 

7 Revisitation patterns in World Wide Web navigation 
Linda Tauscher, Saul Greenberg 

March 1997 Proceedings of the SIGCHI conference n Human fact rs in c mputing 
systems 

Full text available: ^ pdf(984.35 KB) Additional Information: full citation , references , citings , index terms 
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8 Flexible use of memory for replication/migration in cache-coherent DSM 
multiprocessors 

Vijayaraghavan Soundararajan, Mark Heinrich, Ben Verghese, Kourosh Gharachorloo, Anoop 
Gupta, John Hennessy 

April 1998 ACM SIGARCH Computer Architecture News , Proceedings of the 25th 

annual international symposium on Computer architecture, volume 26 issue 3 
Full text available: gjp^f ^ 7 g M b) ^ Additional Information: full citation , abstract , references , citings , index 
Publisher Site ^s 

Given the limitations of bus-based multiprocessors, CC-NUMA is the scalable architecture of 
choice for shared-memory machines. The most important characteristic of the CC-NUMA 
architecture is that the latency to access data on a remote node is considerably larger than 
the latency to access local memory. On such machines, good data locality can reduce 
memory stall time and is therefore a critical factor in application performance. In this paper 
we study the various options available to system desi ... 

9 Hive: fault containment for shared-memory multiprocessors 

J. Chapin, M. Rosenblum, S. Devine, T. Lahiri, D. Teodosiu, A. Gupta 

December 1995 ACM SIGOPS Operating Systems Review , Proceedings of the fifteenth 

ACM symposium on Operating systems principles, volume 29 issue 5 
Full text available: -Q pdf(1.90 MB) Additional Information: full citation , references , citings , index terms 



10 Contention elimination by replication of sequential sections in distributed shared 
memory programs 

Honghui Lu, Alan L. Cox, Willy Zwaenepoel 

June 2001 ACM SIGPLAN Notices , Proceedings of the eighth ACM SIGPLAN symposium 
on Principles and practices of parallel programming, volume 36 issue i 

Full text available* 1 ?Hpdf(173 49 KB) Additiona ' Information: full citation , abstract , references , citings, index 
■ stt* 1 — ' terms 

In shared memory programs contention often occurs at the transition between a sequential 
and a parallel section of the code. As all threads start executing the parallel section, they 
often access data just modified by the thread that executed the sequential section, causing 
a flurry of data requests to converge on that processor. We address this problem in a 
software distributed shared memory system by replicating the execution of the sequential 
sections on all pr ... 

11 Combined performance g ains of simple cache protocol extensions 
F. Dahlgren, M. Dubois, P. Stenstrom 

April 1994 ACM SIGARCH Computer Architecture News , Proceedings of the 21ST 

annual international symposium on Computer architecture, volume 22 issue 2 
Full text available* 13 odf(1 22 MB) Additional Information: full citation , abstract , references , citings, index 
'^^^ terms 

We consider three simple extensions to directory- based cache coherence protocols in 
shared-memory multiprocessors. These extensions are aimed at reducing the penalties 
associated with memory accesses and include a hardware prefetching scheme, a migratory 
- sharing optimization, and a competitive-update mechanism. Since they target different 
components of the read and write penalties, they can be combined effectively. Detailed 
architectural simulations using five benchmarks show substantial combined ... 

12 An integrated memory management scheme for dynamic alias resolution 
Tzi-cker Chiueh 
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August 1991 Proceedings of the 1991 ACM/IEEE conference on Supercomputing 

Full text available: Q pdf(944,90 KB) Additional Information: full citation , references , citings , index terms 



1 3 Translating collocations for bilingual lexicons: a statistical approach 
Frank Smadja, Kathleen R. McKeown, Vasileios Hatzivassiloglou 

March 1996 C mputational Linguistics, volume 22 issue 1 

Full text available:^, ff| 

pdf(2.83 MBjjra* 1 Additional Information: full citation , abstract , references , citings 
Publisher Site 

Collocations are notoriously difficult for non-native speakers to translate, primarily because 
they are opaque and cannot be translated on a word-by-word basis. We describe a program 
named Champollion which, given a pair of parallel corpora in two different languages and a 
list of collocations in one of them, automatically produces their translations. Our goal is to 
provide a tool for compiling bilingual lexical information above the word level in multiple 
languages, for 'different domains. The a ... 

14 Compiler-b ased I/O prefetchin g for out-of-core a p plications 
Angela Demke Brown, Todd C. Mowry, Orran Krieger 

May 2001 ACM Transactions on Computer Systems (TOCS), volume 19 issue 2 

Full text available* fiD pdf (499.03 KB) Adclitional Information: full citation , abstract , references , citings, index 
. [aj * terms , review 

Current operating systems offer poor performance when a numeric application's working set 
does not fit in main memory. As a result, programmers who wish to solve "out-of-core" 
problems efficiently are typically faced with the onerous task of rewriting an application to 
use explicit I/O operations (e.g., read/write). In this paper, we propose and evaluate a fully 
automatic technique which liberates the programmer from this task, provides high 
performance, and requires only minima ... 

Keywords: compiler optimization, prefetching, virtual memory 
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Methods of storing collections of related data in memory can 
include storing a first collection of related data in a first 
collection of tagged data pieces and storing a second col- 
lection of related data in a second collection of tagged data 
pieces. Each tagged data piece of the first collection can 
include a respective header including a first tag identification 
common to each of the tagged data pieces of the first 
collection and a fragment identification unique to each of the 
tagged data pieces of the first collection. Similarly, each 
tagged data piece of the second collection can include a 
respective header including a second tag identification com- 
mon to each of the tagged data pieces of the second 
collection and a fragment identification unique to each of the 
tagged data pieces of the second collection wherein the first 
and second tag identifications are distinct. Related systems 
and computer program products are also discussed. 
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ABSTRACT : 

Methods of storing collections of related data in memory can include storing a 
first collection of related data in a first collection of tagged data pieces 
and storing a second collection of related data in a second collection of 
tagged data pieces. Each tagged data piece of the first collection can include 
a respective header including a first tag identification common to each of the 
tagged data pieces of the first collection and a fragment identification unique 
to 'each of the tagged data pieces of the first collection. Similarly, each 
tagged data piece of the second collection can include a respective header 
including a second tag identification common to each of the tagged data pieces 
of the second collection and a fragment identification unique to each of the 
tagged data pieces of the second collection wherein the first and second tag 
identifications are distinct. Related systems and computer program products 
are also discussed. 



Detail Description Paragraph - DETX (82) : 

[0108] When a tagged data piece of a collection with a corresponding 
fragment list saved, is relocated as a result of sector reclamation, or is 
updated with new data, the new memory device address for the tagged data piece 
is entered into the fragment list (in place of the old memory device address 
for the fragment) in fragment identification order. The fragment list can be 
maintained in dynamic memory of the memory controller 33 as long as the data 
collection corresponding to the fragment list is being used by the processor 
35. Once the data collection is no longer being used or is not expected to be 
used in the near future, the fragment list for the data collection can be 
de- allocated or removed from dynamic memory 34 . 

Detail Description Paragraph - DETX (84) : 

[0110] A pointer for the all list 331 of FIG. 20 can also be maintained to 
identify the last fragment list accessed by the memory controller. If the 
memory controller has last accessed data collection B, the pointer can be used 
to identify the location of FragBLst in the all list. If the next request for 
a data collection is for data collection B, the memory controller 33 can 
quickly locate the fragment list for data collection B using' the pointer and 
the all list 331. Otherwise, the memory controller can search the all list for 
the tag identification for the data collection to be accessed. If a fragment 
list for the data collection is currently open, the search of the all list by 
tag identification will allow the memory controller to quickly locate the 
dynamic memory address of the corresponding fragment list. If a fragment list 
for the data collection is not currently open, the tag identification will not 
be located in the all list. Accordingly, the memory controller can get the 
address for the sorted list from the all list, locate the tag identification in 
the sorted list, and then generate a fragment list for the desired data 
collection. 

Detail Description Paragraph - DETX (85) : 

[0111] In other words, the pointer and the all list can be used to quickly 
locate open fragment lists for previously compiled data collections and/or to 
generate a fragment list if a fragment list is not currently opened, when data 
of a collection is to be retrieved, the memory controller can first check the 
last accessed tag identification designated by the all list pointer . Because a 
data collection may be accessed repeatedly, there is a relatively high 
probability that the last accessed data collection may be the same as the data 
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collection to be accessed. Accordingly, access time may be saved by first 
checking the last accessed data collection. If the tag identification of the 
last accessed data collection matches the tag identification of the data 
collection to be accessed, the memory controller can use the corresponding 
dynamic memory address to locate the appropriate fragment list. 

Detail Description Paragraph - DETX (90) : 

[0116] A collection of tagged data pieces can be located according to 
embodiments of the present invention using sorted and all lists as discussed 
below with reference to FIG. 30. If access to a collection is requested at 
block 801, the memory controller can first check the all list pointer at block 
803, and if the all list pointer identifies an all list entry corresponding to 
the requested collection at block 805, the data can be retrieved at block 821. 
If the all list pointer does not identify an all list entry corresponding to 
the requested collection at block 805, the all list can be scanned at block 
807. If an entry in the all list corresponds to the requested collection at 
block 809, the all list pointer can be set to the entry corresponding to the 
requested collection at block 819, and the data can be retrieved at block 821. 
If the all list does not include an entry for the requested collection at block 
809, the sorted list can be scanned at block 811. If an entry in the sorted 
list corresponds to the requested collection at block 813, a fragment list for 
the requested collection can be generated at block 815, an entry for the 
requested collection can be added to the all list at block 817, the all list 
pointer can be set to the new all list entry at block 819, and the data can be 
retrieved at block 821. Because a collection may be accessed repeatedly, 
access times can be reduced by first checking the all list pointer that 
identifies the last accessed collection and then scanning the all list that 
identifies a plurality of the most recently accessed collections before 
scanning the sorted list of all collections. 

Detail Description Paragraph - DETX (129) : 

[0155] Like an index, a record set can include a list of record numbers for 
a table in order of a sorting criteria without including actual record data. 
In addition, a record set may be assembled manually by inserting specific 
record numbers in a specific order. A record set may also be created as a 
snapshot of an index. Unlike an index, a record set is stored in non-volatile 
memory for future use. 
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